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Abstract 

We give a derivation of the Einstein equation for gravity which employs a definition of the 
local energy density of the gravitational field as a symmetric second rank tensor whose value 
for each observer gives the trace of the spatial part of the energy-stress tensor as seen by that 
observer. We give a physical motivation for this choice using light pressure. 
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1 INTRODUCTION 

Since Einstein's and Hilbert's original "derivations" of the Einstein equation for gravity in classical 
general relativity (CGR), there have appeared too many to list. The many different types of 
derivations are summarized in [15J . In fact, all these subsequent derivations as well as Hilbert's 
original derivation contrast markedly from Einstein's original derivation in that they appeal to 
some abstract mathematical principal which though desirable, is usually not justifiable beyond 
mere desire. For instance, one of the most popular derivations simply modifies one side of the 
equation to make it have zero divergence on grounds that physical considerations make the other 
side, the energy-stress tensor, have zero divergence. On the other hand, in Einstein's original 
derivation, [JJ, we see the realization that mathematically the Ricci tensor should be proportional 
to the source which should be the total energy density due to both the energy-stress tensor as 
well as the gravitational field itself. However, in [JJ, Einstein was not able to arrive at a tensor 
expression for the energy density of the gravitational field. Instead, he arrived at a doubly indexed 
quantity which he admitted was not a tensor, but rather a pseudo-tensor defined in terms of the the 
connection coefficients and which served to give the energy of the gravitational field for purposes 
of deriving the equation. As the arguments in [3] leading up to the development of CGR show, 
Einstein was clearly thinking of the energy of the gravitational field in a Newtonian way, since in 
particular, the connection coefficients are the generalized gravitational forces from the Newtonian 
viewpoint. Moreover, in [3], Einstein was very clear that his equation was using the energy density 



of the gravitational field in addition to the energy-stress tensor as the total source of gravitiy. In 
fact, subsequent attempts to mathematically characterize the energy of the gravitational field have 
all basically clung to the Newtonian framework which makes the energy of the gravitational field 
a function of a non-local arrangement of masses and energies. So much so, that these views are 
now taken for granted to the point that in [9] we have the claim of the impossibility of existence of 
a local energy density tensor for the gravitational field (see also |17j,[6j,[5j,|2j,[16j). This attitude 
clearly persists to the present as expressed, for instance, in chapter 3 of |16| . The result has been 
a profusion of mathematically inspired notions of quasi- local mass, which all have their advantages 
and drawbacks as discussed in [16] and [8] , along with extremely involved analysis required to arrive 
at their basic properties. Fortunately, I was unaware of these problems when I set out to find the 
energy density of the gravitational field in order to derive Einstein's equation. By contrast, we can 
realize the physical representation of the local energy density of the gravitational field by thinking 
of the physics of the gravitational field in relativistic terms. When we do, we see that the resulting 
second rank tensor added to the energy-stress tensor can serve as the total energy density source for 
the gravitational field which then must be proportional to the Ricci tensor. The result in particular 
immediately gives the Einstein equation. The trick is to adopt a truly relativistic attitude towards 
what disturbance of the gravitational field entails, and using laser light pressure as a standard, 
to relate the disturbance back to the energy-stress tensor itself. In particular, we find that the 
divergence of the energy-momentum-stress tensor due to matter and fields other than gravity must 
be zero as a consequence of our derivation. 

2 THE RICCI TENSOR AND DIVERGENCE 

We assume that our spacetime is a 4-manifold M equipped with a Lorentz metric tensor g, with 
signature (—,+,+,-1-), and we denote by V the resulting Kozul connection or covariant differenti- 
ation operator on M. We use T m M to denote the tangent space to M at m £ M. It is convenient 
in this setting to refer to u £ T m M as a unit vector to mean merely \g(u, u)\ = 1. We have then 
the Riemann curvature operator, 7Z, given by 

K(u,v) = [V u ,V v ]-V [uM , (2.1) 

where u and v are any tangent vector fields on M. We note that lZ(u, v) actually defines a vector 
bundle map of the tangent bundle TM to itself covering the identity map of M, and it as well then 
determines the Riemann curvature tensor, R, of fourth rank, which means that 7Z is itself a linear 
transformation valued tensor field on M. One of our main concerns is the certain contraction of R 
known as the Ricci tensor, Ric. In fact in any frame at m E M with basis e a for T m M and dual 
basis uj a , We have, using the summation convention, 

Ric(u, v) = u a (R.(e a ,u)v), u, v G T m M. (2.2) 

Among the many symmetries of the Riemann curvature tensor is the fact that Ric is a symmetric 
tensor. 

In order to see how the Ricci tensor enters into the theory of gravitity, we should recall the 
equation of geodesic deviation. If [—a, a] and [—b, b] is a pair of intervals in M., then a Jacobi 
field is a smooth map J : [—a, a] x [—b, b] — > M such that for each fixed s £ [—a, a] the map 
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J s ; [—6,6] — > M, given by J s {t) = J(s,t), is a unit speed geodesic in M. We can then form local 
vector fields e, u on an open neighborhood of the image of J in M, denoted Im J, so that 

e(J(s, t)) = d s J(s, t), u(J(s, t)) = d t J(s, t). (2.3) 
Thus we must have [e,u] = and V u u = 0, on Im J, so we find 

TZ(e,u)u = -V u V e u. (2.4) 

We will call e in this situation a tangent Jacobi field along Jo. In fact, given m a point on Jo and 
any unit vector e m € T m which is orthogonal to u(m), we can arrange that e(m) = e m . Since our 
connection is assumed to be the unique torsion free metric connection, we have [e, u] = V e u — V u e, 
so the condition that [e, u] = gives V ' e u = V u e in our present case. In view of (|2.4p . we then find 
the equation of geodesic deviation on Im J. 

K(e,u)u + S/ u V u e = 0. (2.5) 

Now the term V^V^e should be interpreted as the rate of change of separation acceleration in 
direction e of infintesimally separated geodesies. If 5s is a small change in the parameter s, then 
we can think of (5s)e as representing the separation between the geodesic Jo and the geodesic Js s - 
Thus V u [(<5s)e] represents the rate of change of separation from J$ s as seen by an observer moving 
along Jo. Then X7 u V u [(5s)e] represents acceleration in separation between the observer following 
Jo and the geodesic Js s - Dividing by 5s then gives the rate of change in the direction e of the 
separation acceleration. This means that g(e,V u V u e) = g(e, IZ(e, u)u) is a "term" in a typical 
spatial divergence calculation, in this case of the acceleration field. That is, it is the e— component 
of the rate of change of separation acceleration in direction e. If u is assumed time-like, it follows 
from (|2.2p that Ric(u, u) is in fact giving an invariant form of a spatial divergence of the separation 
acceleration field for infinitesimally close geodesies to Jo, as would be seen by an observer following 
along Jo. To be clearer about this, an observer following Jo could view the separation of nearby 
geodesies as a position vector field on n -1 -, the orthogonal complement of his velocity vector, so the 
change in separation would then be viewed as the spatial velocity of nearby points (test particles 
suspended) in his space, so its rate of change is the acceleration field a u as seen by the observer. 
We then have for e £ T m M that 

a u (e) = V u V u e, eB i C TM. (2.6) 

If (ei, 62, £3) is a smooth frame field along Jo for the orthogonal complement of the time- like unit 
vector field u along Jo, which is orthonormal at m on Jo, then from (|2.2[) and (|2.6p . we have at m, 

Ric(u,u) = -[g(ei,V u S/ u ei) + g(e 2 , V n V u e 2 ) + g(e 3 , V M V u e 3 )] = -div u a u , (2.7) 

where a denotes the spatial separation ( compare [TO]. 8.9, page 219 and 4.2.2, page 114) 
acceleration field around Jo, and div u a denotes the spatial divergence of this spatial vector field 
around Jo. At this point, one might object that the observer could be rotating which would introduce 
fictional acceleration into a u , and that is correct. A more sophisticated analysis here could deal 
with this purely mathematically (see for instance [5] , [9] , or [15] ) , but let us allow that the observer 
can feel if he is rotating and just say he restricts to cases where he is not rotating in order to carry 
out his measurements. Continuing then, for a non-rotating observer, the separation acceleration 
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field in a geometric theory of gravity is the essence of the gravitational field. That is, if an observer 
at event m G M has velocity u, with g(u,u) = —1, then according to (|2.5|) . (12. 6p . and (|2.7p we 
should interpret R(u, u) as the negative divergence of the gravitational "force per unit mass" field as 
seen by that observer at m G M. Now, in ordinary vector analysis, the divergence of an irrotational 
vector field, divided by 4-7T, is its source density, and we know that the source density for gravity is 
energy density. Therefore there should be a universal positive constant G so that (l/4:ir)Ric(u, u) 
is equal to the energy density observed by this observer multiplied by G, as gravity always acts 
as an attractive influence between objects. But, thinking relativistically, if the gravitational field 
itself has energy, that energy density must be included in the source total energy density. Thus, 
the question now arises as to the energy density of the gravitational field itself. 



3 THE ENERGY DENSITY OF THE GRAVITATIONAL FIELD 

In order to deal with the energy density of the gravitational field, we must first think in terms of 
the basic assumption of the geometric notion of gravity which is that "free test" particles must 
follow geodesies. If this is the case, then from the point of view of the gravitational field itself, it is 
happiest when all particles are following geodesies. In fact, we can imagine that in a limiting sense, 
if "all particles" follow geodesies, then the gravitational field is completely relaxed and contains 
no energy. It is only when we try to push a particle off of its geodesic that we feel the reaction of 
the gravitational field, and notice we feel it right at the location of the event of trying to push the 
particle off of its geodesic. Thus, relativistically, we should think of the manifestation of tension 
in the gravitational field is particles not following geodesies. Now, if a particle is not following a 
geodesic, then it is because it is being acted on by a force which is not part of the gravitational field 
itself. Because by definition, gravity acts only through causing particles to follow geodesies, in the 
absence of "outside" forces. When a force acts to move a particle a certain amount off of its geodesic 
path, the force required to do so is proportional to the particles inertial mass, by definition, but in 
essence, this says the gripping energy of the gravitational field at the point where the particle is 
located is somehow related to the mass of the particle. Accepting this, the force density should be 
manifested in pressures in various directions. That is, the energy-stress tensor tells us the pressures 
as seen by any observer in various directions, so from the energy-stress tensor itself, we should be 
able to find the energy density of the gravitational field. For instance, the pressure you feel on your 
bottom when sitting in a chair is a manifestation of the energy density of the gravitational field 
at those points on your chair. In a sense then, we could say that if the surface of your chair were 
replaced by an infinitesimally thin slab sitting on top of an infinitesimally lower chair, then the 
mass energy of the slab required to hold you in place divided by the volume of the slab is a reflection 
of the energy density of the gravitational field. What is the minimum mass which can take care of 
this job? In fact, the material the chair is made of in some sense is a reflection of the energy density 
of the gravitational field right where your chair is located. Even primitive people have an intuitive 
idea of the strength of material needed to make a chair, and thus have a working idea of the energy 
density of the gravitational field. We should therefore think of the least mass energy of material 
required to make a chair as a rough measure of the energy density of the gravitational field where 
the chair is to be used. More generally, imagine an observer located at m G M, ghost-like inside a 
medium with energy-stress tensor T and suppose that his velocity at m is u. Then exponentiating 
u 1 - C T m M, the orthogonal complement of u in T m M, the pressures are given by the restriction of 
T m to u 1 - x u- 1 . This is a symmetric tensor on a Euclidian space so can be diagonalized. Notice 
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that this does not mean T itself is diagonalizable. Thus, there is an orthonormal frame (e x , e y , e z ) 
for u 1 - with the property that T(e a , e^) = p a 8Z, for a, b € {x, y, z}. Imagine scooping out a tiny box 
in M at m whose edges are parallel to these principal axes of this spatial part of the energy-stress 
tensor. We can imagine putting infinitesimally thin reflecting mirrors for walls of the box and filling 
the box with laser beams reflecting back and forth in directions parallel to the edges of the box 
with enough light pressure in each direction to balance the force from outside on these reflecting 
walls. In a sense, we have standardized a system to balance the pressures acting to disturb the 
gravitational field, so we define the energy density of the gravitational field as seen by our observer 
to be the energy density of the light in this little box. The fact that a photon has zero rest mass 
should mean that the light energy constitutes a minimum amount of energy to accomplish this task 
of balancing the gravitational energy. However, it is an elementary problem in physics to see that 
the energy density of the light along a given axis is exactly the pressure in that principal direction. 
Let us review this simple argument. Assume the coordinates are (t,x,y,z) for simplicity and the 
box edges are parallel to these axes with lengths 5x, Sy, 5z, respectively. Assume that the laser 
beams parallel to say the x— axis contain N x photons, each having spatial momentum P x . In time 
St, the photons travel a distance of cSt and hence each such photon makes (cSt) / (Sx) reflections for 
a change in momentum of 2P X for each reflection. Thus the total momentum transfer to the two 
end walls perpendicular to the x— axis for the laser beams paralleling the x— axis is 

2P x N x c5t . . 

—fa— ("J 

This means that the force exerted on the two end walls is (2P x N x c)/(Sx). But the total area of the 
two end walls is 25y5z, so the pressure on the end walls is 

P, = (3.2) 

where V = 5x5ySz is the volume of the box. But the relativistic energy of a photon with momentum 
P x is P x c. Therefore, the total energy density due to the x— axis beams is exactly the pressure in the 
x— direction on the walls perpendicular to the x— axis. Likewise for the other two axes, consequently 
we see that the energy density in the box is the sum of the pressures that the beams are balancing, 
that is the trace of the observer's spatial part of the energy-stress tensor, p x + p y + p z . 



4 THE ENERGY DENSITY TENSOR 
OF THE GRAVITATIONAL FIELD 

In this section, and the next, we will be applying what we will call the symmetric tensor observer 
principle (see Appendix on Analytic Continuation). In our situation, as applied to second rank 
tensors, it means that if A and B are both symmetric second rank tensors at m € M, and if 
A(u,u) = B(u,u) for all unit time-like vectors, it follows purely mathematically, that A = B. This 
is the essence of the Principle of Relativity as applied to second rank symmetric tensors-a law 
(at m), say A = B, should be true for all observers (at m) and conversely, if true for all observers 
(at m), that is if A(u, u) = B(u, u) for all time-like unit vectors u G T m M, then it should be a law 
(at m) that A = B. We are going to obtain the Einstein equation by simply observing that if H is 
the symmetric tensor giving the total energy density at m G M, then (l/4irG)Ric(u, u) = GH(u, u) 
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for each vector u G T m M such that g(u,u) = — 1. First, let c(A) denote the contraction of A for 
any second rank covariant tensor A. Thus using the summation convention, we have 

c{A)=g^A a(3 . (4.1) 

Now suppose that u G T m M is the velocity of an observer at m G M. Suppose that T is the second 
rank covariant energy-stress tensor. We can define the projection tensor P u : T m M — ► T m M by 
Pu{w) = w + g(w,u)u, for any w G T m M. Note here that we are not assuming w is time-like or 
space-like, it is just any vector in T m M. We easily check that P u is linear, that P u oP u = P u , and that 
P u (T m M) = u 1 - C T m M. For instance, if w G T m M, then g(u, P u (w)) = g(u,w) + g(w, u)(— 1) = 0, 
so P u {T m M) C u . On the other hand, if w G u -1 -, then obviously P u {w) = it?, and therefore, 
P u {T m M) = ir 1 . Thus, A(v,w) = T(P u (v), P u (w)) is a symmetric tensor at m and restricted to 
u^~, this tensor can be diagonalized as g on u 1 - is positive definite. That is to say, there is an 
orthonormal frame (ej, e2,e%) for n -1 so that A(ei,ej) = \Sj. Thus, we see that the box argument 
of the preceding section shows that the sum of these diagonal values of A must represent the 
energy density of the gravitational field, namely, a = A(ei,e%) + A[e2-,^2) + ^4(e3,e3). Now, if 
we compute c(T) in the frame (u,e±,e2,es), we find that c(T) = —T(u,u) + a, and therefore, 
a = T(u, u) + c(T) = T(n, u) — c(T)g(u, u), which is to say finally that the second rank symmetric 
covariant tensor T g , where 

T g = T- c(T)g (4.2) 

represents the energy density of the gravitational field at m € M, because by the symmetric tensor 
observer principle, T g at m G M is completely determined by specifying T g (u,u) for each time-like 
unit vector u G T m M. 

5 THE DERIVATION OF THE EINSTEIN EQUATION 

Now, finally, from the fact that (l/Air)Ric(v,,u) is the divergence of the spatial gravitational accel- 
eration per unit mass, using (|4.2p our gravitation equation should be that for every timelike unit 
vector (that is every observer) at m G M, we have 

RlC ^ u) = G[T(u, u) + T g (u, u)) = G[T(u, u) + T(u, u) - c(T)g(u, «)]. (5.1) 

47T 

Since (|5.ip is homogeneous of order 2, it follows that the equation is true for u being any timelike 
vector, and hence by the symmetric tensor observer principle, we must have 

Rir 

— = G[2T - c(T)g] = 2G[T - (l/2)c(T)g}. (5.2) 

Let us denote the total energy-stress tensor by H = T + T g = 2T — c(T)g. In these terms, we have 
simply 

Ric = 4vrG H. (5.3) 

Equivalently, we have 

Ric = 8ttG[T - (l/2)c(T)g] (5.4) 
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which is a well-known form of Einstein's equation. As c(g) = 4 and c(Ric) = R, where as usual, 
R is the scalar curvature, we find that R = (8tt)G[—c(T)], so the equation can be also written as 
Ric = 8irGT + (l/2)Rg, and this results immediately in the most familiar form of the Einstein 
equation 

E = Ric — (l/2)Rg = 8vrG T, (5.5) 

where E = Ric — (l/2)Rg is the Einstein tensor. Notice that we have not used local conservation 
of energy, divT = 0. Since the left side of (|5.5|) . the Einstein tensor, E, is divergence free, we find 
divT = as a consequence of our derivation. 

We can observe that our derivation required the assumption that the pressures are positive in 
order for the laser light box argument to justify the expression on the right side of (|5.2p as the 
total energy density tensor of the matter fields and gravitational fields, but the development is so 
general at this point, that it seems reasonable that the expression (I4.2D should be regarded as the 
energy density of the gravitational field in all cases. This means that the total energy density is 
the expression H on the right hand side of (15.3|) . 

H = T + T g = 2T - c{T)g = 2{T - {l/2)c{T)g\. (5.6) 

Reconsidering T g = T — c(T)g, it is probably more natural to think of the gravitational field's 
energy-stress tensor, T g as a function of the metric tensor in some way. For this we just use the 
Einstein equation itself. Since c(T) = (—1/8ttG)R, from (|5.5j) we immediately conclude that 

T g = (l/8nG)[Ric + {l/2)Rg\ = (l/8nG)(E + Rg). (5.7) 

From the last expression on the right, we see, as T and the Einstein tensor, E = Ric — (l/2)Rg 
both have zero divergence, that 

divH = divTg = (l/87rG)div(Rg) = (l/8ivG)dR = -d[c(T)]. (5.8) 

So even though the total energy and gravitational energy are not infinitesimally conserved, the 
divergence is simply proportional to the exterior derivative the scaler curvature. Of course, divT g = 
—d[c(T)] is obvious from the definition, (14. 2h . once we accept divT = 0. In particular, as d 2 = 0, 
this means that 

d[divH] = d[divT g ] = 0, (5.9) 

but (|5,8p is even better as it shows divT g is an exact 1-form on M. On the other hand, the equation 
(|5.8p . when written 

divT g + d[c(T)] = (5.10) 

has another interpretation. In classical continuum mechanics written in four dimensional form of 
space plus time, the divergence of the energy stress tensor equals the density of external forces. Of 
course in relativity, the energy stress tensor T contains everything and there are no external forces, 
as gravity is not a force. But, we can view (|5.10p as saying that from the point of view of the 
gravitational field, the matter and fields represented by T are acting on the gravitational field as 
an external force density of —d[c{T)\. In classical continuum mechanics, the external force density 
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has zero time component, but relativistically such is not the case, the force only has zero time 
component in the instantaneous rest frame of the object acted on. We can therefore view (|5.10p as 
saying that the divergence of the the gravitational field's energy stress tensor is being balanced by 
the rate of increase of — c(T). If p x ,Py,Pz are the principal pressures in the frame of an observer 
with velocity u, where g(u,u) = — 1, then p = T(u,u) is the energy density observed, and divT g (u) 
is then the power loss density of the gravitational field. Now c(T) = — p + p x + p y + p z , so (|5.10l) 
becomes 



divT g (u) = D u p- D u [p x +Py+p z }- (5.11) 

Thus, the observer sees the divergence of energy of the gravitational field is exactly the rate of 
increase of energy density of the matter and fields less the rate of increase of principal pressures. In 
particular, in any dust model of the universe (pressure zero), the gravitational energy dissipation is 
exactly balanced by the rate of increase of energy density of the matter and fields. If T is purely the 
electromagnetic stress tensor in a region where there are only electromagnetic fields, then c(T) = 0, 
and the gravitational energy-stress tensor has zero divergence, so is then infinitesimally conserved. 



6 THE GRAVITATION CONSTANT G 

So far, we have not said anything about the determination of the gravitation constant G. To evaluate 
this, we merely need to check the results of experiments with attractive "forces" between masses. 
But it is much simpler to just use Newtonian gravity in an easy example where the results should 
be obviously approximately the same. Consider an observer situated at the center of a spherical 
dust cloud of uniform density p, and calculate the separation acceleration field using Newton's 
law of gravitation. At distance r from the center, but inside the cloud, the mass acting on test 
particles at radial distance r is simply the mass inside that radius, M(r), by spherical symmetry, 
as is well-known in Newtonian gravitation. Here, we have M(r) = (4/3)-7rr 3 p. But Newton's Law 
says the acceleration of a test mass near the center of the dust cloud is radially inward, and if r is 
the distance from the center, then the radial component of acceleration is given by 

M(r) Anpr 
a r (r) = -G N — 5— = -G N ——. (6.1) 

Here, Gn is the Newtonian gravitation constant. On the other hand, considering an angular 
separation of 9, the spatial separation is s = r6, so the relative acceleration of nearby test particles 
in the s— direction perpendicular to the radial direction is therefore 

a s (r) = 9a r (r) = -G N ^- = -G N ^f. (6.2) 

Thus the rate of change of separation acceleration of nearby radially separated test particles in the 
radial direction at given r is by (|6.2|) . 

(6.3) 

whereas in the s direction we have the rate of change of separation acceleration is 

da s 4irp 

-dF = ~ Gn —> (6 ' 4) 
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the same result again. But there are two orthogonal directions perpendicular to the radial, so now 
we see that if a n denotes the spatial acceleration field around our observer at the center of the dust 
cloud, then 

div u (a u ) = -G N 4np. (6.5) 

As we are dealing with dust, the pressures are zero, so there is no gravitational energy density, 
and thus p is now the total energy density seen by our observer. Thus, we have by (|2.7p . that 
R(u,u) = Gn^P = 4-7rGiv H(u,u). But now comparing this result with (|5.3p . we see that we 
must have G = Gn- Notice that in our development, we have used the symmetric tensor observer 
principle as a form of the principle of general relativity to reduce everything to working with the 
time component in an arbitrary frame for the tangent space. The trick is to be able to work 
completely generally so that conclusions apply to Too and Ricqq no matter the coordinate frame, 
which seems best expressed by using T(u, u) and R(u, u), to remind us that we are dealing with an 
arbitrary time-like unit vector. It is only now at the end once we have Einstein's equation that we 
allow a calculation in a special frame in order to evaluate the gravitation constant. 

At this point let us discuss for a moment the derivation of Einstein's equation given in [5]. In 
effect, the derivation of the Einstein equation given in [5] uses the analysis of the special case of a 
static arrangement of mass for a gravitating fluid drop and adds the Newtonian energy density of 
the fluid drop as expressed in terms of pressure through the requirement that its surface pressure be 
zero to get the time component of the Einstein equation. Since the setup is a special arrangement 
of mass, one cannot assert the symmetric tensor observer principle, because the only observer for 
which the equation works is the special observer moving with the drop. However, one can appeal 
to the general covariance desire of relativity that equations should be tensor equations valid in all 
frames, from which one surmises that if you have found an equation relating the time components 
in a special frame, then the other components in that special frame should also be equal. Once 
you accept the full tensor equation in any frame, then it is valid in all frames and you next surmise 
that if it works for the liquid drop, then it must work in general. But, in our present situation, 
we have the full equation, and can simply go backwards through the development in [5] to see that 
the time component of the equation in the liquid drop case is Newton's law, and therefore again 
conclude that our G in (|5.5p is identical to the Newtonian gravitational constant. For a treatment 
of linearized Einstein gravity and its Newtonian approximation in general, on can consult [9] or 

m 

At this point, we can simply choose units such that G = 1 and we henceforth drop this factor 
from the equation for simplicity. 

7 ENERGY CONDITIONS 

Since we have an expression for the total energy density H = T+T g , we could surmise that in general 
it would be reasonable to have H (u, u) > 0, for every time-like tangent vector u G TM. But, this is 
the same as requiring that 2T(u,u) > — c(T), a condition known as the strong energy condition. In 
formulating the various energy conditions, it will be useful to denote A = A — (l/2)c(A)g, when A 

is any second rank covariant tensor. For instance, we observe easily that A = A, and Ric = E. We 
say A is observer non-negative definite if A(u, u) > 0, for every time-like vector u, whereas we say 
that A is dominantly non-negative if A(u, v ) > whenever u and v are vectors with g(u, v) < 0. 
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Picking a future half of the light cone arbitrarily at m E M, we note that the future light cone 
is an open subset of the tangent space T m M, and therefore there is a basis for the tangent space 
consisting of future time-like unit vectors, say ui, U2, U3, u^. In this frame we have A a p > and 
9a.fi < 0. If A denotes the transformation of TM uniquely defined by g(A(u),v) = A(u,v) for 
all vectors u, v over the same base point, then saying A is dominantly non-negative is equivalent 
to requiring —A{u) be future time-like or null whenever u is future time-like, which is the usual 
statement of the dominant energy condition for A. The weak energy condition simply requires that 
T is observer non-negative definite, whereas the strong energy condition requires that T is observer 
non-negative definite. Thus, by the Einstein equation, the strong energy condition is equivalent to 
requiring that Ric = 8ttT be observer non-negative definite. As it is expected that the pressures 
and stresses are smaller than the mass energy, it is reasonable that c(T) < 0, so the weak energy 
condition is probably weaker than the strong energy condition. Notice the strong energy condition 
is really completely geometric, as it says simply Ric(u, u) > for every time-like vector u, and Ric 
can be determined by the connection without reference to the metric. We say that the energy-stress 
tensor T satisfies the dominant energy condition provided that T is dominantly non-negative. Now, 
H = 2T — c(T)idTM- Thus, the dominant energy condition holds for T, if and only if —H{u) — c{T)u 
is future time-like or null for any future time-like vector u € TM. However, thinking of Ric = 4ttH 
with H the total energy, it would now seem reasonable to require that H and hence also Ric be 
dominantly non-negative. 

8 THE COSMOLOGICAL CONSTANT 

If we include the cosmological constant A in the Einstein equation, it becomes 

Ric-(l/2)Rg + Ag = 8irT, (8.1) 

which is of course the same as 

Ric - (l/2)Rg = 8vr[T - (l/8vr)A 5 ], (8.2) 
which means we view the equation here as having a modified energy-stress tensor 

T A = T- (1/8tt)A 5 . (8.3) 

We then have c(T\) = c(T) — (l/27r)A, so the effective energy-stress tensor of the gravitational 
field is 

T g = T- c(T)g - (1/2tt)A 5 , (8.4) 
and the effective total energy-stress tensor serving as source is 

H = 2T- c(T)g - (1/2tt)A 5 . (8.5) 

In any case, as div g = 0, it follows that our conclusions about the energy-momentum flow of 
the gravitational field from (I5.10p and (I5.11|) remain valid, even in the presence of a cosmological 
constant. 
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9 QUASI LOCAL MASS 



The problem of defining the energy contained in a space-like hypersurface has led to many different 
definitions of the mass enclosed by a closed space-like surface contained in an arbitrary spacetime 
manifold, and these go by the general name quasi-local mass. Typically, they are defined by some 
kind of surface integral and give an indication of the mass enclosed by the space-like surface. For 
an extensive survey of these we refer the interested reader to [16J . In particular, the results of [13] 
on the Penrose quasi-local mass show that the results can be interesting when the space-like surface 
is not the boundary of a space-like hypersurface. A list of desirable properties of any definition of 
quasi-local mass is given in [8], where in particular it is shown that for their definition, the quasi- 
local mass enclosed by a space-like surface S is non-negative provided that the dominant energy 
condition holds and the surface S is the boundary of a hypersurface, O. It is further assumed that 
the boundary surface S has positive Gauss curvature and space-like mean curvature vector, and 
consists of finitely many connected components. The local energy condition assumed is framed in 
terms of the second fundamental form of the hypersurface, and in particular, we can see that for a 
geodesic hypersurface it reduces to the condition that the scalar curvature of the hypersurface, fi, 
is non-negative, since in that case the second fundamental form vanishes (extrinsic curvature zero). 
But, the scalar curvature of the space-like hypersurface f2 is 2E(u,u) = 16irT(u,u), where u is a 
time-like future pointing unit normal field on Q. So if the energy-stress tensor satisfies the weak 
energy condition in this case, then the energy density as seen by observers riding the hypersurface 
is non-negative, and we would simply integrate (1/8tt)E(u,u) over the hypersurface to find the 
energy inside, which is clearly non-negative. The amazing result in [H] is that the quasi-local mass 
defined there is defined in terms of integrals over the boundary S. For instance, their results show 
if the energy inside any one component of S vanishes, then S is connected and Q is flat ([8], 
Theorem 1, page 183), and thus the result shows that the energy in Q is in some sense determined 
by the geometry of the boundary and its mean curvature vector under the assumptions stated 
above. In order to make use of the total energy-stress tensor, H, in a similar setting, one would 
assume an appropriate energy condition, and then for a space-like hypersurface K with future 
time-like unit normal field u, it is natural to consider H(u,u)hk where is the volume form 
due to the Riemannian metric induced on K. The integral of H(u,u)[j,k ° ver an of fl should be 
the total energy inside K. More generally, if we assume that H is dominantly non-negative, that 
is, it satisfies the dominant energy condition, then given another reference future pointing time- 
like vector field k, one might then integrate H(u,k)nx over K. If a 2-form a can be found on K 
satisfying da = H(u, k)/ijc, and if K is a 3-submanifold with boundary B, then by Stoke's theorem, 
the total energy inside K is the integral of a over the boundary B of K. In particular, we say that 
K is instantaneously static if there is an open set U C M containing K and a vector field k on U 
which is future pointing and orthogonal to K and which satisfies Killing's equation, at each point 
of K. If uo = k* is the dual 1-form to k, so u(v) = g(k,v) for all vectors v, then this is equivalent 
to requiring Sym(^7 uj)\K = or equivalently that (duo)\K = 2Vuo\K, which to be perfectly clear 
means that the difference dw — 2Vw as calculated on U in fact is zero at each point of K. Then as 
in the Komar [7j integral (see [31], [32], pages 287-289 or [T2], pages 149-151) it follows that 



Here, * denotes the Hodge star operator on M. Thus, (— l/8-7r) * du; is a potential for the total 
energy on K. For any closed 2-submanifold 5 of if we define the quasi-local total energy H(S, k) 
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by 

H(S,k) = -±- [ *d(k*). (9.2) 
8?r Js 

Thus, if Kq C K is a submanifold with boundary S = OKq, then by Stoke's Theorem, (|9,2p becomes 

H(S,k) = ~ f d*d(k*)= f H(u,k)nK, (9.3) 

which is then non-negative if H is observer non-negative definite. Of course, this is the same as 
requiring Ric be observer non-negative definite, a purely geometric requirement. Thus, if H (S, k) = 
0, with S = 8Kq, then by (19. 3p . under the assumption that H is dominantly non-negative, we 
would conclude that Ric(u,k) = on Kq. But, this means that Ric(u,u) = on Kq, and this 
means that the scalar curvature of Kq is identically zero. In particular, if Kq has constant sectional 
curvature, this would imply that Ko is actually flat. Notice that if we have an asymptotically flat 
spacetime with a global time-like killing vector field orthogonal to a spacelike slice, normalized to 
be a unit vector at spatial infinity, then our definition of the quasi-local total energy would be 
exactly the Komar mass which is well known in the literature [16] . Thus in the expression H(S, k), 
the normalization for k is determined by requiring that it be of unit length at the event at which 
the observer is located. If the observer is located so that S is in the observer's causal past, then 
it would seem we must assume that the domain of k contains this past light cone. In general, if 
k is a Killing field on all of the open set U, then being orthogonal to K means ([T7], page 119, 
(6.1.1)) that also to A duj = 0, where u> = k* . Then (see pT|, page 443, (C.3.12)) we find, using 
f = ln(\g(k,k)\), 

duj = —uj A df, (9.4) 

and using the fact that here *[u) A df] = —{e^ 2 D n f)^s-, where n is the outward unit normal to 
S = OKq, and fis = dA is the area 2-form on S, we obtain finally, 

H(S,k) = ~ [ ef' 2 D n fdA. (9.5) 
87r Js 

In particular, for the vacuum Schwarzschild solution with mass M, taking the Killing field k = dt, 
we see easily that the mass calculated using the integral (I9.5P gives the value M for the mass 
enclosed by any sphere centered at the "origin" when we normalize the Killing field to be a unit 
vector at infinity. On the other hand, if we calculate that value of the integral by normalizing 
to make the Killing vector a unit at radial coordinate ro, as H(S,k) is homogeneous in k, the 
normalizing constant comes out resulting in 

Mro = nw 11/2 - (9-6) 

Keeping in mind this is now the total energy, gravitational and massive, this indicates a problem 
develops as ro — > 2M, even though we know it is not a real problem for the spacetime. This indicates 
the problem is probably due to the normalization involving the Schwarzschild radial coordinate 
which obviously breaks down at ro = 2M. After all, what we are integrating is equivalent by 
Stoke's Theorem to integrating H(u,k)(J,Ki when S = OKq, and we really want to be integrating 
H(u,u)fJLK- We do not have the actual potential. 
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10 APPENDIX ON ANALYTIC CONTINUATION 



Suppose that 5 : E x E — ► F is any symmetric bilinear map of vector spaces. Let the quadratic 
function fg be defined by fg(x) = S(x, x). Then fg determines 5. This is just (what mathematicians 
would call) polar decomposition: 

S(x,y) = (l/A)[f s (x + y)-f s (x-y)}. (10.1) 

That is, we note that 

f s (x ±y) = f s (x) + f s (y) ± 2S(x, y), (10.2) 

so subtracting the "minus" equation from the "plus" equation of (|10.2|) gives (|10.1|) . In particular, 
if fg = 0, then 5 = 0. But more is true. For, suppose E is a topological vector space and that fg is 
constant on the open subset U C E. Choosing x G U and y sufficiently "small", we can assume that 
both x + y and x — y belong to U in which case we have S(x, y) = from the polar decomposition 
(jlO.ip . But from homogeneity, it follows that S(x,y) = 0, for every x € U, and every y £ E. Since 
5 is symmetric, it follows that S(y, x) = 0, for every x £ U, and every y £ E. Now, if x, y S E are 
any vectors, simply choose a vector x$ G U, and we have S(x,xq) = 0, so S(x,y) = S(x,y — xq), 
and we will see that S(x,y — Xq) = 0. For there is a small scalar t ^ so that xq + t(y — xq) belongs 
to U, and therefore, = S(x, xq + t(y — xq)) = S(x, xq) + tS(x, y — xq). Since S(x, xq) = 0, it now 
follows that S(x, y — xq) = 0, and thus finally we have S(x, y) = 0. The argument can be simplified 
by using differentiation (see for instance [15] . page 72), but we prefer to give a purely algebraic 
argument here. Using differentiation, we next generalize easily to n-linear maps, but a telescoping 
algebraic argument could be applied with a little more effort. 

Thus, if S is a continuous symmetric n— linear map of the Banach space E into the Banach space 
F, and if we define the monomial function fg : E — > F by the rule fg{x) = S(x, x, x, x) = Sx^ n > , 
then fg is an analytic function. In fact, if xi,X2,x%, ...,x n £ E, then differentiating we find 

D Xl D X2 D X3 ...D Xn f(a) = (n\)S(x 1 ,x 2 ,x 3 ,...,x n ), aeE. (10.3) 

From (|10.3p . we see very generally that if U is any open subset of E on which fg is constant, then in 
fact, 5 = 0, since we can choose a E U. Indeed, if a 6 U, since fg is constant on U, it follows that the 
derivative on the left side of the equation fjlQ.3f) is 0, and hence also the right side, for every possible 
choice of vectors x\, x%, X3, ...x n € E. But notice that a does not appear on the right hand side 
of (|10.3p . only S(xi, X2, x$, ■■■x n ), and the vectors x\, x 2 , x%, ...x n can be chosen arbitrarily. Thus, 
5 = follows. This is just a very special case of the principle of analytic continuation. However, 
to see (|10.3|) . it served our purpose for the Einstein equation to only examine the case where n = 2 
and the vector spaces are finite dimensional, so the vector topologies are unique and any bilinear 
map is therefore continuous and in fact smooth. In this case, we have for any x,w G E, the easily 
checked fact that 

D w fg(x) = S(x, w) + S(w, x) = 25(x, w). (10.4) 

Differentiating again, since as a function of x alone D w fg is obviously linear, it follows that for any 

x,v,w G E, 

D v D w f s (x) = 2S(v,w). (10.5) 
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It should be obvious how the general case of f)10.3j) from the method of showing (|10.5j) . In any 
case, clearly, if fg is constant on some (no matter how small) non-empty open subset U of E, then 
choosing x G U would give D v D w fs(x) = for every v,w G E and every x G U. Thus the right 
hand side of (|10.5p vanishes for every v,w G E and this says S = 0. In our application, we are 
taking the vector space to be the tangent space to M at a specific point, say m G M, so E = T m M. 
Then, either half of the light cone, future or past, is an open subset of T m M and therefore, if S 
is a symmetric second rank tensor at m whose quadratic form vanishes on the future light cone or 
vanishes on the past light cone, or on any small open subset of the light cone, then S = 0. It follows 
that in general relativity theory, symmetric tensors at a point can be specified by their monomial 
forms on the future or on the past light cone at that point, and hence it suffices, by homogeneity of 
the monomial forms to limit consideration to time-like unit vectors. We might say in fact that in 
general relativity, any tensor equation should be viewed as a tensor valued symmetric tensor, and 
on all slots for which there is symmetry one need only evaluate by using the same arbitrary unit 
time-like vector in each of those slots. For instance, if A(a, b, c, d,e) is a 5 — linear transformation 
which is symmetric in the first three slots, then considering A(a, b, c, _, _) as a third rank symmetric 
tensor which is second rank tensor valued, it is completely determined by knowing A(u, u, u, _, _) for 
every time-like unit vector u G TM. This is what we call the symmetric tensor observer principle. 
In some sense, this is the principle of relativity. If we apply this to the electromagnetic field, F, for 
instance, as it is anti-symmetric, that is, a 2-form, it should be viewed as a 1-form valued 1-form. 
Thus, it is determined by giving the 1-form F(u) for each time-like unit vector u. In fact, we can say 
that an observer with velocity u holding a test charge Q should feel the force QF(u), so F(u) is the 
force 1-form per unit charge experienced by an observer holding an electric charge. Since the force 
vector in the instantaneous rest frame is orthogonal to the velocity, we must have F(u, u) = here. 
As this experiment could be carried out by any observer, this means that F(u, u) = for every 
time-like unit vector, so Sym(F) = 0, by the symmetric tensor observer principle, and therefore 
F = Alt(F) must be a 2-form. We can similarly say that any force field which applies to certain 
objects called "charges", according to a scalar measure of that charge making force felt by any 
observer holding the charge exactly proportional to the charge, would be described by a 2-form. 
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